39 research outputs found

    Mitochondrial metagenomics: letting the genes out of the bottle

    Get PDF
    ‘Mitochondrial metagenomics’ (MMG) is a methodology for shotgun sequencing of total DNA from specimen mixtures and subsequent bioinformatic extraction of mitochondrial sequences. The approach can be applied to phylogenetic analysis of taxonomically selected taxa, as an economical alternative to mitogenome sequencing from individual species, or to environmental samples of mixed specimens, such as from mass trapping of invertebrates. The routine generation of mitochondrial genome sequences has great potential both for systematics and community phylogenetics. Mapping of reads from low-coverage shotgun sequencing of environmental samples also makes it possible to obtain data on spatial and temporal turnover in whole-community phylogenetic and species composition, even in complex ecosystems where species-level taxonomy and biodiversity patterns are poorly known. In addition, read mapping can produce information on species biomass, and potentially allows quantification of within-species genetic variation. The success of MMG relies on the formation of numerous mitochondrial genome contigs, achievable with standard genome assemblers, but various challenges for the efficiency of assembly remain, particularly in the face of variable relative species abundance and intra-specific genetic variation. Nevertheless, several studies have demonstrated the power of mitogenomes from MMG for accurate phylogenetic placement, evolutionary analysis of species traits, biodiversity discovery and the establishment of species distribution patterns; it offers a promising avenue for unifying the ecological and evolutionary understanding of species diversity

    Intraspecific genetic variation in complex assemblages from mitochondrial metagenomics: comparison with DNA barcodes

    Get PDF
    Metagenomic shotgun sequencing, using Illumina technology, and de novo genome assembly of mixed field-collected amples of invertebrates readily produce mitochondrial genome sequences, allowing rapid identification and quantification of species diversity. However, intraspecific genetic variability present in the specimen pools is lost during mitogenome assembly, which limits the utility of ‘mitochondrial metagenomics’ for studies of population diversity. 2. Using 10 natural communities (>2600 individuals) of leaf beetles (Chrysomelidae), DNA variation in the mitochondrial cox1-5’ ‘barcode’ was compared for Sanger sequenced individuals and Illumina shotgun sequenced specimen pools. 3. Generally, only a single mitochondrial contig was assembled per species, even in the presence of intraspecific variation. Ignoring ambiguity from the use of two different assemblers, the cox1 barcode regions from these assemblies were exact nucleotide matches of a Sanger sequenced barcode in 90.7% of cases, which dropped to 76.0% in assemblies from samples with large intra and interspecific variability. Nucleotide differences between barcodes from both data types were almost exclusively in synonymous 3rd codon position, although the number of affected sites was very low, and the greatest discrepancies were correlated with poor quality of Sanger sequences. 4. Unassembled shotgun reads were also used to score single nucleotide polymorphisms and to calculate intraspecific nucleotide diversity (pi) for all available populations at each site. These values correlated with Sanger sequenced cox1 variation but were significantly higher. 5. Overall, the assemblage-focused shotgun sequencing of pooled samples produced nucleotide variation data comparable to the well-established specimen-focused Sanger approach. The findings thus extend the application of mitochondrial metagenomics of complex biodiversity samples to the estimation of diversity below the species level

    Bulk de novo mitogenome assembly from pooled total DNA elucidates the phylogeny of weevils (Coleoptera: Curculionoidea)

    Get PDF
    Complete mitochondrial genomes have been shown to be reliable markers for phylogeny reconstruction among diverse animal groups. However, the relative difficulty and high cost associated with obtaining de novo full mitogenomes have frequently led to conspicuously low taxon sampling in ensuing studies. Here, we report the successful use of an economical and accessible method for assembling complete or near-complete mitogenomes through shot-gun next-generation sequencing of a single library made from pooled total DNA extracts of numerous target species. To avoid the use of separate indexed libraries for each specimen, and an associated increase in cost, we incorporate standard polymerase chain reaction-based “bait” sequences to identify the assembled mitogenomes. The method was applied to study the higher level phylogenetic relationships in the weevils (Coleoptera: Curculionoidea), producing 92 newly assembled mitogenomes obtained in a single Illumina MiSeq run. The analysis supported a separate origin of wood-boring behavior by the subfamilies Scolytinae, Platypodinae, and Cossoninae. This finding contradicts morphological hypotheses proposing a close relationship between the first two of these but is congruent with previous molecular studies, reinforcing the utility of mitogenomes in phylogeny reconstruction. Our methodology provides a technically simple procedure for generating densely sampled trees from whole mitogenomes and is widely applicable to groups of animals for which bait sequences are the only required prior genome knowledge

    An efficient and robust laboratory workflow and tetrapod database for larger scale environmental DNA studies

    Get PDF
    BACKGROUND: The use of environmental DNA for species detection via metabarcoding is growing rapidly. We present a co-designed lab workflow and bioinformatic pipeline to mitigate the 2 most important risks of environmental DNA use: sample contamination and taxonomic misassignment. These risks arise from the need for polymerase chain reaction (PCR) amplification to detect the trace amounts of DNA combined with the necessity of using short target regions due to DNA degradation. FINDINGS: Our high-throughput workflow minimizes these risks via a 4-step strategy: (i) technical replication with 2 PCR replicates and 2 extraction replicates; (ii) using multi-markers (12S,16S,CytB); (iii) a "twin-tagging," 2-step PCR protocol; and (iv) use of the probabilistic taxonomic assignment method PROTAX, which can account for incomplete reference databases. Because annotation errors in the reference sequences can result in taxonomic misassignment, we supply a protocol for curating sequence datasets. For some taxonomic groups and some markers, curation resulted in >50% of sequences being deleted from public reference databases, owing to (i) limited overlap between our target amplicon and reference sequences, (ii) mislabelling of reference sequences, and (iii) redundancy. Finally, we provide a bioinformatic pipeline to process amplicons and conduct PROTAX assignment and tested it on an invertebrate-derived DNA dataset from 1,532 leeches from Sabah, Malaysia. Twin-tagging allowed us to detect and exclude sequences with non-matching tags. The smallest DNA fragment (16S) amplified most frequently for all samples but was less powerful for discriminating at species rank. Using a stringent and lax acceptance criterion we found 162 (stringent) and 190 (lax) vertebrate detections of 95 (stringent) and 109 (lax) leech samples. CONCLUSIONS: Our metabarcoding workflow should help research groups increase the robustness of their results and therefore facilitate wider use of environmental and invertebrate-derived DNA, which is turning into a valuable source of ecological and conservation information on tetrapods

    Soup to tree: the phylogeny of beetles inferred by mitochondrial metagenomics of a Bornean rainforest sample

    Get PDF
    In spite of the growth of molecular ecology, systematics and next-generation sequencing, the discovery and analysis of diversity is not currently integrated with building the tree-of-life. Tropical arthropod ecologists are well placed to accelerate this process if all specimens obtained via masstrapping, many of which will be new species, could be incorporated routinely in phylogeny reconstruction. Here we test a shotgun sequencing approach, whereby mitochondrial genomes are assembled from complex ecological mixtures via mitochondrial metagenomics, and demonstrate how the approach overcomes many of the taxonomic impediments to the study of biodiversity. DNA from ~500 beetle specimens, originating from a single rainforest canopy fogging sample from Borneo, was pooled and shotgun sequenced, followed by de novo assembly of complete and partial mitogenomes for 175 species. The phylogenetic tree obtained from this local sample was highly similar to that from existing mitogenomes selected for global coverage of major lineages of Coleoptera. When all sequences were combined, only minor topological changes are induced against this reference set, indicating an increasingly stable estimate of coleopteran phylogeny, whilst the ecological sample expands the tip-level representation of several lineages. Robust trees generated from ecological samples now enable an evolutionary framework for ecology. Meanwhile, the inclusion of uncharacterized samples in the tree-of-life rapidly expands taxon and biogeographic representation of lineages without morphological identification. Mitogenomes from shotgun sequencing of unsorted environmental samples and their associated metadata, placed robustly into the phylogenetic tree, constitute novel DNA ‘superbarcodes’ for testing hypotheses regarding global patterns of diversity

    Shifting up a gear with iDNA: From mammal detection events to standardised surveys

    Get PDF
    Invertebrate-derived DNA (iDNA), in combination with high throughput sequencing, has been proposed as a cost-efficient and powerful tool to survey vertebrate species. Previous studies, however, have only provided evidence that vertebrates can be detected using iDNA, but have not taken the next step of placing these detection events within a statistical framework that allows for robust biodiversity assessments. Here, we compare concurrent iDNA and camera-trap surveys. Leeches were repeatedly collected in close vicinity to 64 camera-trap stations in Sabah, Malaysian Borneo. We analyse iDNA-derived mammalian detection events in a modern occupancy model that accounts for imperfect detection and compare the results with those from occupancy models parameterised with camera-trap-derived detection events. We also combine leech-iDNA and camera-trap data in a single occupancy model. We found consistent estimates of occupancy probabilities produced by our camera-trap and leech datasets. This indicates that the metabarcoding of leech-iDNA method provides reasonable estimates of occupancy and may be a suitable method for studying and monitoring mammal species in tropical rainforests. However, we also show that a more extensive collection of leeches would be needed to assess mammal biodiversity with a robustness similar to that of camera traps. As certain taxa were only detected in leeches, we see great potential in complementing camera-trap studies with the iDNA approach, as long as the collection of leeches follows a robust and standardised sampling scheme. Synthesis and applications. Here, we describe an approach to analyse detection records of mammals derived from leech samples using an occupancy framework that accounts for leech-specific factors influencing the detection probability. We further combined camera trap and leech data, which lead to increased confidence in occupancy estimates. Our approach is not restricted to the processing of leech samples, but can be used for the analysis of other invertebrate DNA and environmental DNA data. Our study is the first step to shift the application of invertebrate DNA studies from opportunistic ad-hoc collections to the systematic surveys required for long-term management of wildlife populations

    Metagenome skimming of insect specimen pools: potential for comparative genomics

    Get PDF
    Metagenomic analyses are challenging in metazoans, but high-copy number and repeat regions can be assembled from lowcoverage sequencing by “genome skimming,” which is applied here as a new way of characterizing metagenomes obtained in an ecological or taxonomic context. Illumina shotgun sequencing on two pools of Coleoptera (beetles) of approximately 200 species each were assembled into tens of thousands of scaffolds. Repeated low-coverage sequencing recovered similar scaffold sets consistently, although approximately 70% of scaffolds could not be identified against existing genome databases. Identifiable scaffolds included mitochondrial DNA, conserved sequences with hits to expressed sequence tag and protein databases, and knownrepeatelementsof high and low complexity, includingnumerous copies ofrRNAandhistone genes.Assemblies of histones captured a diversity of gene order and primary sequence in Coleoptera. Scaffolds with similarity to multiple sites in available coleopteran genome sequences for Dendroctonus and Tribolium revealed high specificity of scaffolds to either of these genomes, in particular for high-copy number repeats. Numerous “clusters” of scaffolds mapped to the same genomic site revealed intraand/or intergenomic variation within a metagenome pool. In addition to effect of taxonomic composition of the metagenomes, the number of mapped scaffolds also revealed structural differences between the two reference genomes, although the significance of this striking finding remains unclear. Finally, apparently exogenous sequences were recovered, including potential food plants, fungal pathogens, and bacterial symbionts. The “metagenome skimming” approach is useful for capturing the genomic diversity of poorly studied, species-rich lineages and opens new prospects in environmental genomic

    Uncovering trophic interactions in arthropod predators through DNA shotgun-sequencing of gut contents

    Get PDF
    Characterizing trophic networks is fundamental to many questions in ecology, but this typically requires painstaking efforts, especially to identify the diet of small generalist predators. Several attempts have been devoted to develop suitable molecular tools to determine predatory trophic interactions through gut content analysis, and the challenge has been to achieve simultaneously high taxonomic breadth and resolution. General and practical methods are still needed, preferably independent of PCR amplification of barcodes, to recover a broader range of interactions. Here we applied shotgun-sequencing of the DNA from arthropod predator gut contents, extracted from four common coccinellid and dermapteran predators co-occurring in an agroecosystem in Brazil. By matching unassembled reads against six DNA reference databases obtained from public databases and newly assembled mitogenomes, and filtering for high overlap length and identity, we identified prey and other foreign DNA in the predator guts. Good taxonomic breadth and resolution was achieved (93% of prey identified to species or genus), but with low recovery of matching reads. Two to nine trophic interactions were found for these predators, some of which were only inferred by the presence of parasitoids and components of the microbiome known to be associated with aphid prey. Intraguild predation was also found, including among closely related ladybird species. Uncertainty arises from the lack of comprehensive reference databases and reliance on low numbers of matching reads accentuating the risk of false positives. We discuss caveats and some future prospects that could improve the use of direct DNA shotgun-sequencing to characterize arthropod trophic networks

    Data from: Shotgun mitogenomics across body size classes in a local assemblage of tropical Diptera: phylogeny, species diversity and mitochondrial abundance spectrum

    No full text
    Mitochondrial genomes can be assembled readily from shotgun-sequenced DNA mixtures of mass-trapped arthropods (“mitochondrial metagenomics”), speeding up the taxonomic characterization. Bulk sequencing was conducted on some 800 individuals of Diptera obtained by canopy fogging of a single tree in Borneo dominated by small (<1.5 mm) individuals. Specimens were split into five body size classes for DNA extraction, to equalize read numbers across specimens and to study how body size, a key ecological trait, interacts with species and phylogenetic diversity. Genome assembly produced 304 orthologous mitochondrial contigs presumed to each represent a different species. The small-bodied fraction was the by far most species-rich (187 contigs). Identification of contigs was through phylogenetic analysis together with 56 reference mitogenomes, which placed most of the Bornean community into seven clades of small-bodied species, indicating phylogenetic conservation of body size. Mapping of shotgun reads against the mitogenomes showed wide ranges of read abundances within each size class. Ranked read abundance plots were largely log-linear, indicating a uniformly filled abundance spectrum, especially for small-bodied species. Small-bodied species differed greatly from other size classes in neutral metacommunity parameters, exhibiting greater levels of immigration, besides greater total community size. We suggest that the established uses of mitochondrial metagenomics for analysis of species and phylogenetic diversity can be extended to parameterize recent theories of community ecology and biodiversity, and by focusing on the number mitochondria, rather than individuals, a new theoretical framework for analysis of mitochondrial abundance spectra can be developed that incorporates metabolic activity approximated by the count of mitochondria

    Data from: Phylogenetic community ecology of soil biodiversity using mitochondrial metagenomics

    No full text
    High-throughput DNA methods hold great promise for the study of taxonomically intractable mesofauna of the soil. Here, we assess species diversity and community structure in a phylogenetic framework, by sequencing total DNA from bulk specimen samples and assembly of mitochondrial genomes. The combination of mitochondrial metagenomics and DNA barcode sequencing of 1494 specimens in 69 soil samples from three geographic regions in southern Iberia revealed >300 species of soil Coleoptera (beetles) from a broad spectrum of phylogenetic lineages. A set of 214 mitochondrial sequences longer than 3000 bp was generated and used to estimate a well-supported phylogenetic tree of the order Coleoptera. Shorter sequences, including cox1 barcodes, were placed on this mitogenomic tree. Raw Illumina reads were mapped against all available sequences to test for species present in local samples. This approach simultaneously established the species richness, phylogenetic composition and community turnover at species and phylogenetic levels. We find a strong signature of vertical structuring in soil fauna that shows high local community differentiation between deep soil and superficial horizons at phylogenetic levels. Within the two vertical layers, turnover among regions was primarily at the tip (species) level and was stronger in the deep soil than leaf litter communities, pointing to layer-mediated drivers determining species diversification, spatial structure and evolutionary assembly of soil communities. This integrated phylogenetic framework opens the application of phylogenetic community ecology to the mesofauna of the soil, among the most diverse and least well-understood ecosystems, and will propel both theoretical and applied soil science
    corecore